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Suppose that there are n bins, and balls arrive in a Poisson pro- 
cess at rate An, where A > is a constant. Upon arrival, each ball 
chooses a fixed number d of random bins, and is placed into one 
with least load. Balls have independent exponential lifetimes with 
unit mean. We show that the system converges rapidly to its equilib- 
rium distribution; and when d>2, there is an integer-valued function 
md{n) — lnlnn/lnd + 0(l) such that, in the equilibrium distribution, 
the maximum load of a bin is concentrated on the two values rrid(n) 
and rrid(n) — 1, with probability tending to 1, as n — * oo. We show 
also that the maximum load usually does not vary by more than a 
constant amount from lnlnn/lnd, even over quite long periods of 
time. 

1. Introduction. Balls-and-bins processes have been useful for modeling 
and analyzing a wide range of problems, in discrete mathematics, computer 
science and communication theory, and, in particular, for problems which 
involve load sharing, see, for example, [4, 5, 12, 15, 16, 17, 22]. Here is one 
central result, from [3]. Let d be a fixed integer at least 2. Suppose that 
there are n bins, and n balls arrive one after another: each ball picks d bins 
uniformly at random and is placed in a least loaded of these bins. Then 
with probability tending to 1 as n — > oo, the maximum load of a bin is 
lnlnn/lnd + 0(1). 

In some recent work, balls have been allowed to "die," see [3, 7, 21], 
which is, of course, desirable when modeling telephone calls. For example, 
suppose that we start with n balls in n bins: at each time step, one ball 
is deleted uniformly at random, and one new ball appears and is placed in 
one of d bins as before. It is shown in [3] that, as n — ► oo, at any given time 
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t > cn 2 In Inn, with probability tending to 1, the maximum load of a bin is 
at most lnlnra/ hid + O(l). 

The results mentioned above all concern discrete time models, where at 
each time step a ball may arrive or a ball may die and be replaced by a new 
one. Here we analyze a simple and natural continuous time "immigration- 
death" balls-and-bins model. We concentrate on the maximum bin load, 
which may be the quantity of greatest interest, for example, in load-sharing 
models. 

The scenario we consider is as follows. Let d be a fixed positive integer, 
say d = 2. Let n be a positive integer and suppose that there are n bins. 
Balls arrive in a Poisson process at rate An, where A > is a constant. 
Upon arrival, each ball chooses d random bins (with replacement), and is 
placed into a least loaded bin among those chosen. (If there is more than one 
chosen bin with least load, the ball is placed in the first such bin chosen.) 
Balls have independent exponential lifetimes with unit mean. This process 
goes on forever. 

This model was first studied by Turner in [21], who considers weak conver- 
gence, for a suitable choice of state space. (Also, [19, 20] contain a discussion 
of the completeness of the state space under the product topology.) Turner 
shows that (with appropriate assumptions on the initial distribution), for 
each fixed non-negative integer k the fraction of bins with load at least k, 
converges weakly as n — > oo to a deterministic function v(t,k) defined on 
M + x Z + , where the vector (v(t,k) :k 6 Z + ) is the unique solution to the 
system of differential equations for k = 1,2,..., 

(1) ^ = X(v(t, k-l) d - v{t, kf) - k(v{t, k) - v{t, k + 1)), t>0, 

subject to v(t, 0) = 1 for all t > 0, and appropriate initial values (v(0, k):k£ 
Z + ) such that 1 > v (0, k) > v(0, k + 1) > for all fc£N. The weak-convergence 
result applies only to fixed-index co-ordinates (i.e., fixed values of k) over 
fixed-length time intervals, and yields no information on the speed of con- 
vergence. Our approach is different, and we are not concerned with weak 
convergence, although weak convergence could be deduced from our results. 
The key step is to establish concentration results, which apply to the frac- 
tion of bins with load at least k at time t (where k,t need not be fixed); 
these concentration results may then be used to analyze a balance equation 
involving these quantities. We are thus able to handle random variables like 
the maximum load, over long periods of time. 

For each time t > and each j = 1, . . . , n, let Xt (j) be the random number 
of balls in bin j at time t, and let X t be the load vector (X t (l), . . . ,X t (n)). 
Thus, the total number of balls \X t \ at time t is given by \X t \ = J21=i Xt(j)- 
We shall always assume that the initial load vector Xq satisfies E[|Xo|] < 
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oo. Note that \Xt\ follows a simple immigration-death process, and so its 
stationary distribution is the Poisson distribution Po(An) with mean An. 

It is easy to check that, for given d and n, the load vector process (X t ) is 
Markov, with state space (Z + ) n . Standard results show that there is a unique 
stationary distribution II; and, whatever the distribution of the starting 
state Xq, the distribution of the load vector Xt at time t converges to II as 
t — > oo. Indeed, this convergence is very fast, as our first theorem will show. 

For x E Z n , let = De the L\ norm of x. (Thus, we have 

\X t \ = \\X t \\i.) We use C{X) to denote the probability law or distribution of 
a random variable X. The total variation distance between two probability 
distributions fi± and U2 may be defined by d^v^i, ^2) = inf Pr (X 7^ Y), 
where the infimum is over all couplings of X and Y, where C(X) = \x\ and 
C(Y) = fj,2- Equivalently, 

*rv(A»l, = max | Pr (X G A) - Pr (Y e A)\, 

where the maximum is over all suitable sets A. We also use the Wasserstein 
distance, defined by dwO^ij/^) =infE[||X — Y\\i], where the inf is over 
couplings of X and Y as above. For distributions [i\ and U2 on Z n , we have 
<i T v("i,«2) < d w (/xi,/i 2 )- 

Theorem 1.1. Let d and n be positive integers, and let II be the cor- 
responding stationary distribution for the load vector. Suppose that initially 
the balls are arbitrarily distributed over the bins, with E[|Xo|] < 00. Then 
for each time t>0, 

d TV (C(X t ),n) < d w (C(X t ),U) < (An + E[|X |])e-*. 

For each e > and initial state x, the mixing time r(e,x) is defined by 
considering (Xt), where Xq = x a.s. and setting 

t(s, x) = inf{t > : d TY (C(X t ), U) < e}. 

[Recall that dTy(£(Xt),H) is a nonincreasing function of t.] Thus, for ex- 
ample, if denotes the state with no balls, then the above theorem shows 
that 

r(e,0) <ln(An/e). 

This upper bound on the mixing time is, in fact, of the right order, in 
that r(i,0) = 6(lnn), as we shall see after the proof of Theorem 1.1 by 
considering the behavior of the total number of balls present. For mixing 
results on related models, see [4, 7]: mixing appears to be slower when balls 
live forever. 

As we commented earlier, our primary interest is in the maximum load 
of a bin. Let Mt = maxj Xt(j) be the maximum load of a bin at time t. 
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Thus, M t = H-Xtlloo, where || x||(x) is the infinity norm maxj|xj| of x. The 
above theorem shows that we can essentially restrict our attention to the 
stationary case, at least if we are interested in times well beyond Inn, so 
let us now consider that case. We may write M instead of Mt when the 
system is in equilibrium. The behavior of the maximum load Mt or M is 
very different in the two cases d = 1 and d>2. This is the "power of two 
choices" phenomenon — see, for example, [17]. For clarity, let us write x[ n ^ 

and M t or here to indicate that there are n bins. 

The most interesting case is when d > 2 (indeed, when d = 2), but in order 
to set things in context, let us first consider the (much easier) case when 
d = 1. Suppose then that d = 1. We shall see that is concentrated on 

two values m = m(n) and m — 1, which are close to Inn/ In Inn; and that 
over a polynomial length interval of time, we meet only small (constant 
size) deviations below m but we meet large deviations above m, so that 
the maximum value of M t over an interval of length n K is usually about 
{K + l)m. We use the phrase asymptotically almost surely (a.a.s.) to mean 
"with probability — > 1 as n — > oo." 

(n) 

Theorem 1.2. Let d=l, and suppose that Xq is in the stationary 
distribution (and thus so is M^ for each time t). 

(a) There exists an integer-valued function m = m(n) ~ ) n "" w such that 
a.a.s. M^ n > is m(n) or m(n) — 1. 

(b) For any constant K > 0, 

min M+ n ^ > m(n) — 3 a.a.s. 

0<t<nK 

(c) For any constant K > 0, 



max Mf I — ► K + 1 in probability as n — > oo. 

o<t<n K J Inn 

The notation m = m(n) ~ ppb above means that m(n) = (1 + o(l))p^- 
as n — > oo. It is straightforward to determine m(n) more precisely from the 
proof of the theorem: for example, we have 

Inn . ... (Inn) (In In Inn) 

m n = — — + 1 + o l ^ r^. 

In Inn (mmn) z 

(n) 

Now we consider the case d > 2, when the maximum load Mf> is far 
smaller. Once again, it is concentrated on two values nid = md(n) and — 1, 
but now these numbers are close to In Inn/ In d. This corresponds to the 
behavior of the maximum load in discrete time models; see, for example, 
[3, 4, 12, 16], but is more precise. 
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(n) 

Theorem 1.3. Let d > 2 be fixed, and suppose that Xq is in the 
stationary distribution. Then there exists an integer- valued function = 
md(n) = lnlnn/lnci + 0(1) such that is orm^ — X a.a.s. Further, 

for any constant K > 0, there exists c = c(K) such that 

(2) max \M+ n ^ — lnlnra/lnd| < c a.a.s. 

0<t<n K 

(n) 

The lower bound on M t , in fact, holds over longer intervals than stated 
in (2) above. For example, there is a constant c such that 

(3) min{M t (n) :0<t<e nl/4 }>lnlnn/ln<i-c a.a.s. 

However, the upper bound in (2) does not extend to much longer intervals. 
For example, if K > and r = n Kdlnlnn ^ then 

(4) max M+ n ^ > K In In n a.a.s. 

0<t<T 

The plan of the rest of the paper is as follows. After giving some pre- 
liminary results in the next section, we consider mixing times and prove 
Theorem 1.1. Then we consider the easy case d = 1 when there is one ran- 
dom choice, and prove Theorem 1.2. In order to prove Theorem 1.3, where 
d>2, we need some preliminary results, which are presented in the next 
three sections. First, in Section 5 we give a concentration result for Lips- 
chitz functions of the load vector in equilibrium. In Section 6 we use balance 
equations to establish the key equation (26) concerning the expected pro- 
portion u{i) of bins with load at least % in equilibrium. This result, together 
with the concentration result, yields a recurrence for u(i). After that, in 
Section 7 we consider random processes like a random walk with "drift." 
Then we are ready to prove Theorem 1.3 in Section 8: we first prove upper 
bounds, then lower bounds, and finally we prove the results (3) and (4). 
Last, we briefly consider chaoticity and make some concluding remarks. 

2. Preliminary results. In this section we give some elementary results 
which we shall need several times below. A standard inequality for a binomial 
or Poisson random variable X with mean ^ is that 

(5) Pr(\X-fi\ >e/i) < 2exp(-|e 2 /i) 

for < e < 1 (see, e.g., Theorem 2.3(c) and inequality (2.8) in [14]). Also, 
for each positive integer k, 

(6) Pr (X > k) < fi h /k\ < {en/k) k . 

If X has the Poisson distribution with mean //, let us write X ~ Po(pi): for 
such a random variable, we have 

(7) E[Xl ix > k) ]=»Pr(X>k-l). 
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Next we give an elementary lemma which we shall use later in order to 
extend certain results, for example, concerning the maximum load Mt from a 
single point in time to an interval of time. It yields bounds on the maximum 
and minimum values of a suitable function f(x) over a time interval [0,r]. 

Consider the n-bin case, with set = (Z + ) n of load vectors. Let us say 
that a real-valued function / on fl has bounded increase if whenever s and 
t are times with s < t, then f(xt) is at most f(x s ) plus the total number 
of arrivals in the interval (s,i\; and / has strongly bounded increase if f(xt) 
is at most f(x s ) plus the maximum number of arrivals in the interval (s,t] 
which are placed in any one bin. Thus, for example, f(x) = \x\ has bounded 
increase, and f(x) = maxj x(j) has strongly bounded increase. 

Lemma 2.1. Let (Xt) be in equilibrium. Let s,r > and let a,b be 
non-negative integers. Suppose that (a) / has bounded increase and 5 = 
Pr (Po(Ans) > 6+1), or (b) / has strongly bounded increase and 5 = nPr (Po(AgLs) > 
6+1). In both cases we have 



(8) Pr [f(X t ) < a for some t £ [0, r]] < - + 1 (Pr (f(X ) <a + b) + 5) 



(9) Pr [f(X t ) >a + b for some t € [0, r]] < - + 1 (Pr (f(X ) >a) + S). 



Proof. Consider first the case (a) when / has bounded increase. Note 
that the j = |_-J + 1 disjoint intervals [(r — l)s,rs) for r = 1, .. . ,j cover 
[0, t\. Let B r denote the event of having in total at least 6+1 arrivals in the 
interval [(r — l)s,rs), so that Pr (B r ) = Pr [Po(Ans) > 6 + 1] = 5. Then 



and (9) follows. To handle the case (b) when / has strongly bounded in- 
crease, note that if C r denotes the event of having at least 6 + 1 arrivals in 
the interval [(r — l)s,rs) which are placed into a single bin, then Pr (C r ) < 
nPr [Po(A ds) > 6+ 1]; and then proceed as above. □ 

As we noted earlier, in equilibrium the distribution of the total number of 
balls in the system is Po(An). We close this section by using the last lemma 
to establish a result that will enable us to "control" the total number of 
balls in the system over long periods of time. 




and 




{f(X t ) < a for some t G [0,r]} C |J {f(X rs ) < a + 6} U \J B r 
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Lemma 2.2. For any < e < 1, there exists (3 > such that the following 
holds. Consider an n-bin system, and let (Xt) be in equilibrium. Then a.a.s. 
for all <t < e@ n , the number of balls \Xt\ satisfies 

(l-e)Xn< \X t \ < (l + e)An. 

Proof. By inequality (5), since \X t \ ~Po(An), we have 
Pr (\\X t \ - Xn\ > eXn/2) <2e- e2Xn/12 

and 

Pr [Po(eXn/A) > eXn/2] < 2e' eXn/12 . 

Let (5 satisfy < f3 < ^e 2 A. We use case (a) of Lemma 2.1. Let s = e/4 and 
b = eXn/2: we may now use (8) with a = (1 — e)Xn and (9) with a = (1 + 
e/2)An. 

□ 

3. Rapid mixing: proof of Theorem 1.1. We shall couple (Xt) and a 
corresponding copy (Yt) of the process in equilibrium in such a way that 
with high probability \\Xt — Yt\\i decreases quickly to 0. We assume that 
the choices process always generates a nonempty list of bins at an arrival 
time, and the new ball is placed in a least-loaded bin among those chosen, 
breaking ties if necessary by choosing the first least-loaded bin in the list. 
In the meantime we make no other assumptions about the arrivals process 
or the choices process. We assume as before that balls die independently at 
rate 1, independently of the other two processes. 

The coupling is as follows. Not surprisingly, we give the two processes the 
same arrivals and choices of d bins. The height of a ball in the system at 
a given time is the number of balls in its bin that arrived before it, plus 
one. Assume that we have a family of independent rate 1 Poisson processes 

Fj j. for j = 1, ...,n and k = 1,2, When Fj^ "tolls," any ball in bin j 

at height k in either process dies (so that or 1 or 2 balls die). Observe 
that at any time t, we are interested in only a finite (with probability 1) 
number of these death processes [namely, >~2j Xt(j) V Yf(j)]. We have now 
described the coupling of (Xt) and (Yt). The "memoryless" property of the 
exponential lifetime distribution ensures that it is a proper coupling; and 
when the arrival process is Poisson, and the choices are independent and 
uniform, the joint process (Xt,Yt) is Markov. For x,y G Z n , the notation 
x < y means that x(j) < y(j) for each j = 1, . . . , n. 

Lemma 3.1. With the coupling of (Xt) and (Yt) described above, the 
distance \\Xt — Yt\\i is nonincreasing, and given that \\Xq — Yq\\i = r, it is 
stochastically at most the number of survivors at time t of r independent 
balls. Further, if < s <t and X s < Y s , then Xt <Yt. 
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Proof. Consider a jump time to- Let X to - = x and Y to - = y, and let 
X to = x' and Y to = y' . (We assume right-continuity.) Suppose that to is a 
death ("toll") time. If none or two balls die, then 

( 10 ) Ik'-y'lli = ||x-j/||i, 

and if just one ball dies, then 

( n ) \W -y'\\i = \\x-y\\ 1 -l. 

Thus, at any death time, 

(12) \W-y'\\i < \\x-y\h- 

Suppose now that to is an arrival time, and ball b arrives. We want to 
show that (12) holds. If ball b is placed in the same bin in the two processes, 
then (10) holds and, hence, so does (12). Suppose that ball b is placed in 
bin % in the X-process and in bin j in the Y-process, where i ^ j. Then 
ball b gets "paired" in at least one of the processes, and so (12) holds. (By 
"paired" here, we mean that in the other process there is a ball in the same 
bin at the same height. Observe that these balls will stay paired until they 
die together.) For, note first that x(i) < x(j) and y(j) < y(i), and not both 
are equal by the tie-breaking rule. Now suppose that ball b does not get 
paired in either process. Then we must have x{i) > y{i) and y{j) > x(j), 
and so 

x(i) > y(i) > y(j) > x(j) > x(i). 

But then all the values are equal, a contradiction. 

We have now seen that (12) holds at each jump time, and (11) holds if 
a single unpaired ball dies. Thus, \\Xt — is nonincreasing. Further, we 
claim that, for any time < s < t and any positive integer r, given that 
\\X S — Y s \\i = r and any other history up to time s, the probability that 
\\Xt — Yt\\i = r is at most e~ r ^~ s \ The second part of the lemma will follow 
immediately from the claim. 

To see why the claim is true, let S r denote the set of states (x,y) such 
that ||x — y\\i = r. We have seen that \\Xt — Y-t\\i is nonincreasing. For each 
state (x,y) G S r , there are r of the death processes Fjk such that if any of 
them tolls, then the process moves into SV— i- Thus, if {Xq, Yq) £ S r and T = 
inf{t > : (X t , Y t ) £ S r } is the exit time from S r , then Pr (T > t\(X ,Y ) = 
(x,y)) < e~ rt for each (x,y) £ S r and each t > 0; and the claim follows. 

The final comment on monotonicity is straightforward. For consider a 
jump time to as above, and suppose that x < y. If to is a death time, then 
clearly x' < y', so suppose that to is an arrival time. But if the new ball is 
placed in bin i in the X-process and if x{i) = y{i), then the ball is placed in 
bin i also in the Y-process, so x' < y'. □ 
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We may now rapidly prove Theorem 1.1. By the lemma, E(||A^ — Yj||i|(Xo, Yq) = 
(x, y)) is at most the expected number among r = \\x — y\\i balls that survive 
at least to time t, which is equal to re - *. Since ||x — y||i < \x\ + \y\, we have 

e(||x* - y t ||i|x ,y ) < (\x \ + |y |)e-', 

and so 

d w (C(Xt),C(Y t )) < E(\\X t - Y t ||i) < (E[|X |] + An)e~*. 

This completes the proof of Theorem 1.1. 

We now show that the upper bounds on the mixing times arising from 
Theorem 1.1 are of the right order. We may see this by simply considering 
the total number \Xt\ of balls in the system. In equilibrium, \Xt\ has the 
Poisson distribution Po(An), and so 

dTv{£(X t ),Il)>d TY (jC(\X t \),Po(\n)). 

We shall see that if Xq = a.s. and t < ^ Inn — 2 In Inn, then 

(13) d TV (£(|X t |),Po(ArO) = l-o(l); 

and it follows that, for each < £ < 1, we have r(e, 0) = SI (Inn). 

Suppose then that Xq = a.s. and let /x(i) = E[|X t |]. It is easy to check 
that fx(t) = An(l — e~*). If t is 8(lnn), then, by Lemma 5.5 below (with, 
say, b = ln 3 ^ 2 n), 

Pr(|pC t | - fM(t)\ > \\n 1 / 2 \n 2 n)=e-^ A/2n \ 
Also, if Z ~ Po(An), then, by (5), 

Pr(|Z-An| >n 1 / 2 lnn)=e- n ( ln2n ). 
Now if t is ^lnn — 21nlnn, then |n(t) — An| = Xne~ t = An 1 / 2 ln 2 n, and, thus, 

d TV (C(\X t \),Po(Xn)) = 1 - e - n ( ln3/2 «) = 1 - o(l), 

which gives (13) as required (since the left-hand side is a nonincreasing 
function of t). 

4. One choice: proof of Theorem 1.2. Let A > be fixed, as always. Let 
d = l. Let pi = Pi(\) = e~ Ylk>i tt> the probability that a Po(A) random 
variable takes value at least i. Let Xq be in equilibrium. Stationary bin loads 
are independent Poisson random variables, each with mean A. It follows that, 
for any nonnegative integer i, 

(14) Pr (M t >i)< npi 
and 

(15) Pr (M t < i) = (1 - p t+1 ) n < e- npi +\ 



10 



M. J. LUCZAK AND C. MCDIARMID 



We now prove the three parts of the theorem. 

Part (a). Let uj(n) = In Inn. Let m = m(n) be the least positive integer i 
such that npi + \ < l/u(n). By (14), 

Pr (M t > m + 1) < n Pm+1 = o(l), 

so M t < m a.a.s. Also, np m > l/u(n), so np m _i = ^(^j^ ■ ^y) -> oo. 
Hence, by (15), 

Pr (Af t < m - 2) < e"^™" 1 = o(l). 

Thus, Mt is m or m — 1 a.a.s. Also, it is easy to check that m ~ " ■ 
Pari (b). We apply case (b) of Lemma 2.1, with s ~ n~ K ~ 2 , a = m — 4 

and 6 = 1, together with (6) and (15). 

Part (c). Let Z = max 0<Kn K M t . Let e > 0. We show first that 

(16) Pr (Z > (K + 1 +e) Inn/ In Inn) ^0 as n ->• oo. 

To do this, we apply case (b) of Lemma 2.1, with s ~ exp(— Inn/ lnlnn), 
a ~ (K + 1 + e/2) Inn/ lnlnn and 6 ~ lnn/(lnlnn) 2 , together with (6) and (14). 

Now let < e < K, and let k = \{K + 1 — e) Inn/ lnlnn]. We will show 
that 

(17) Pr(Z<fc)^0 asn^oo, 

which will complete the proof of this part and thus of the theorem. Note 
that npk = n~( K ~ £+0 ^>> = o(l). For each time t > 0, let (j>t be the sigma field 
generated by all events until time t. Let C be the event that \Xt\ < n 2 /2 for 
each t 6 [0, n^]. Then C holds a.a.s. by Lemma 2.2. Let n > 2A and let x be 
a load vector such that \x\ < n 2 /2. Given Xq = x, by Theorem 1.1, 

d TY (C(X t ),U)<(Xn + \x\)e~ t 
< n 2 e"* 

if t > t\ = In 2 n + 2 Inn. In particular, by (15), 

Pr (M tl < k - 1\X Q = x) < e- npk + e~ lv? n . 
Since npk = o(l), 

e -np k + e - In 2 n < g-np fc ^ + gg- In 2 

for n sufficiently large, which we now assume. Thus, for % = 0, 1, . . . , 
Pr (M (i+1)tl < - l|fe) < e"^(l + 2e- ln2 «) 
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on the event D{ = (\Xi tl \ < n 2 /2) A (Mi tl < k — 1). Hence, if we denote 
[n K / i i J by io, we have 

Pr {{Z < k - 1) A C) < Pr ^ /\ 

i -l / i \ 

= Pr (D )H Pr A+l A A" 

j=0 V j=o / 

< ( e -^(i + 2 e - ln2ri )) io 

< (1 + o(l)) • exp(-(n^/ti - lJn-C^"^ 1 ))) 
= exp(-n £+ °W) ^0 

as n — > oo. Above we used the observation that 

(1 + 2e- ln2n ) io < exp(i ■ 2 e - lv?n ) = 1 + o(l). 

5. Concentration. We have seen that our balls-and-bins model exhibits 
rapid mixing. In many Markov models rapid mixing goes along with tight 
concentration of measure. This is indeed the case here, as demonstrated 
by the following lemma, which is crucial to our analysis. See [5] for large 
deviations bounds for a related discrete-time balls-and-bins model. 

Let n be a positive integer, and let £1 be the corresponding set of load 
vectors, that is, the set of nonnegative vectors in Z n . A real- valued function 
/ on is called Lipschitz (with Lipschitz constant 1) if 

I/O) - f(y)\ < \\x - y\\i- 

Lemma 5.1. There is a constant tlq such that, for all n > uq, the n-bin 
system has the following property. Let the load vector Y have the equilibrium 
distribution, and let f be a Lipschitz function on 0. Then, for each u > 
nV2in3/2 n> 

Pr(|/(F)-E[/(F)]|>u)<e-( u2 /") 1/3 . 

As stated in the Introduction, our primary interest is in the maximum 
load of a bin. We may deduce easily from the last lemma the following result 
which we shall use several times. 

Lemma 5.2. Consider the n-bin system in equilibrium. For each positive 
integer i, let L(i) be the random number of bins with at least i balls, at say 
timet = 0, and let l(i) =E[L(i)]. Then 

sup Pr (\L(i) - > n 1/2 ln 3/2 n) = 0{n~ l ); 

i 
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for any constant c> 0, 

Pr fsup|L(i) -l{i)\ ^cn 1 / 2 In 3 rij = e - n(ln2n) ; 

and /or each integer r > 2, 

sup{|E[L(i) r ] - l(i) r \} = 0{n r ~ 1 ln 3 n). 

i 

PROOF. Note that 

Pr (L( \2\n\ ) >0) < Pr (Po(An) > 2An) = e~ n{n \ 

since the total number of balls is Po(An). Since always L(i) < n, this shows 
that we may restrict attention to i < 2Xn. The first two parts of the lemma 
now follow directly from Lemma 5.1 (note that no is a constant, and does 
not depend on /). To prove the third part, set u = (r + l) 3 ^ 2 n 1 / 2 In 3 / 2 n, and 
note that, by Lemma 5.1, 

Pr (\L(i) - >u)< e -(r+i)i™ = 

for n > no- Hence, for each positive integer k <r, 

E[\L(i)-l( y i)\ k }<u k + n k Pr(\L( y i)-l(i)\>u)<u k + o(l). 

The result now follows from 

o < E[L(i) r ] - w =ib(l) mm - m k w) 



k=2 



k 

<j2( r k )n\m-Ki 



k=2 

= 0(n r - 1 ln 3 n). □ 

The next lemma extends the second part of the last lemma, and shows 
that in equilibrium the number Lt(i) of bins with load at least i at time t 
is unlikely to move far from its mean value We show that all the values 
Lt(i) are likely to stay close to l(i) throughout a polynomial length time 
interval [0, r]. 

Lemma 5.3. Let K > be an arbitrary constant, and let r = n . Let 
Xq be in equilibrium. Then 



Pr 



sup sup \L t (i) — > n 1 / 2 In 3 n 
te[o,r] i 



e -C(ln 2 n)_ 
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Proof. By Lemma 5.2, there exists 7 > such that for all n sufficiently 
large, for each time t > 0, 



The rest of this section is devoted to proving Lemma 5.1. The plan of 
the proof is as follows. Consider a loads process (Xt), where Xq = xq for 
a suitable load vector xq. (We are most interested in the case xq = 0.) We 
shall prove concentration for Xt, and later deduce concentration for the 
equilibrium load vector Y. 

Note first that the equilibrium load of a bin is stochastically at most 
Po(Xd). For we can couple the load of a single bin with a process where the 
arrival rate is always exactly Xd and the death rate exactly 1, so that the 
number of balls in the former is no more than in the latter at all times; and 
for the latter process, the equilibrium number of balls is Po(Ad). 

It will be convenient to limit the maximum load of a bin. Let 6 = b(n) be 
an integer at least, say, 4 Inn/ In Inn — we shall specify a value for b later. 
Assume that maxjXo(j) < 6/3. Let At be the event that M s < b for all 
< s < t. If temporarily M s denotes the maximum load of a process in 
equilibrium, then, by the time "monotonicity" part of Lemma 3.1, we have 

Pr (A t ) < Pr (M s > 26/3 for some s G [0,r]). 

Hence, by (9) in Lemma 2.1(b) and by (6), 

Pr (A t ) <(t + l)(2n) Pr (Po(Ad) > 6/3) 

= exp(ln(t + 1) +lnn- \bhxb + 0(b)). 

It follows that, for n sufficiently large, for each time t < e b , say, 

(18) Pr (A t ) < e- blnb/13 . 

In fact, we shall ultimately specify values for t and 6 so that t = 0(61n6). 

Since loads are rarely large, we can approximate the loads process (Xt) 
by using only a few of the death processes Fj^, namely, those with k < 6, 
which we call the "low" death processes. In fact, we shall model both the 
original process and the approximating process, by replacing these low death 
processes by a combined low death Poisson process with rate nb, and a 
"reaper" process (we omit the "grim"), which at each "toll" of the rate n6 
Poisson process selects uniformly at random a pair (j, k) where j G {1, . . . , n} 
and k G {1, . . . , b}, and behaves as if the process Fj^ had "rung." Let X t be 
the approximating process, which uses only the low death processes. Observe 




We now let s = n l l 2 and 6 = 2An 

□ 



1//2 , and use Lemma 2.1(a), inequality (9). 
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that on At we have Xt = Xt. Since Pr {At) is so small, it will suffice for us 
to prove concentration for Xt- 

Let z and z be positive integers. Let t = {ti, . . . ,t z ) be z arrival times 
(not ordered) and let d = {d±, . . . , d z ) be corresponding choices of d bins. Let 
t = {t±, . . . ,t z ) be z low death times (not ordered) and let d = {d\, . . . , d z ) 
be corresponding reaper choices [of pairs (j,k), where j £ {l,...,n} and 
k £ {1, ...,&}]. Assume that all these times are distinct. Given any ini- 
tial load vector x, our approximating simulation generates a load vector 
st{x, t, d, t, d) for each time t > 0. 

The following deterministic lemma is analogous to the first part of Lemma 3.1, 
when the arrivals, choices, death times and reaper choices processes are all 
deterministic, and may be proved in a similar way. 

Lemma 5.4. Suppose that we are given two initial load vectors xq and yo, 
together with any sequence of arrival times t and corresponding bin choices 
d, and departure times t and corresponding reaper choices d, where all these 
times are distinct. Then the distance ||st(xo,t,d,t,d) — st{yo,t,d,t,d)\\i is 
nonincreasing in t, and so, in particular, for each t>0, 

||si(zo,t,d,t,d) -s t (y ,t,d,t,d)||i < ||x -yo||i- 

Similarly, \\st{xQ,t,d,t,d) — st{yo,t,d,t,d)\\ oa is nonincreasing in t [recall 
that \\z\\oo = maxj |z(j)|]. 

Let us now sketch the plan of the rest of the proof. Let /j,{t) = E[f{Xt)] 
and p,{t) = E[f{Xt)\. Let Zt be the number of arrivals in [0, t], so that 
Zf ~ Po(Ant). Let Z% be the number of low death times in [0,t], so that 
Zt ~ ~Po{bnt). We shall condition on Zt = z and Zt = z. Let fi(t,z,z) = 
E[f{Xt)\Zt = z, Z t = z]. We shall use Lemma 5.4 and the bounded differ- 
ences method to upper bound Pr(\f(X t ) — jj,{t,z,z)\ > u\Z t = z,Z t = z), 
see (20) below. 

Next we remove the conditioning on Zt and Zt- To do this, we choose 
suitable "widths" w and w, then use the fact that both Pr {\Zt — Xnt\ > w) 
and Pr {\Zt — bnt\ > w) are small, and for z and z such that \z — Xnt\ < w 
and \z — bnt\ < w, the difference \ ji{t, z, z) — fi{t)\ is at most about 2{w + w), 
see (23) below. We thus find that Pr(|/(X t ) - fi(t)\ > 3{w + w)) is small. 
But since Xt = Xt on At, and At is very likely to occur, this last result 
yields concentration for f{Xt) around its mean. The part of the proof up to 
here is contained in Lemma 5.5 below. Finally, we use the coupling lemma 
(Lemma 3.1) to relate the distribution of Xt (with Xq = 0) to the equilibrium 
distribution. 

Let us start on the details. We shall use the following lemma with xq = 0. 
(It is convenient elsewhere to have the more general form.) 
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Lemma 5.5. There are constants uq and c > such that the following 
holds. Let n > no and b > 41nn/lnlnn be integers, and let f be a Lipschitz 
function on Q. Let also xq G Q be such that maxj^oO) < b/3, and assume 
that the process (X t ) satisfies Xq = xo a.s. Then for all times < t < e b and 
all u> 1, 

(19) Pr(\f(X t )-n t \>u)<ne- cu2/ ( nbt) + e~ cnt + e- 



cblnb 



Proof. Note first that we may assume without loss of generality that 
f(xo) = 0, and so < Z t + Z t , since we could replace f{x) by f{x) — 

f(xo). Let z, z be positive integers, and condition on Z t = z, Z t = z. Then X t 
depends on 2(z + z) independent random variables T\ , . . . , T z , D\, . . . , D z , 
Ti,...,Tj, and D\,...,D Z . Indeed, we may write X t as Sf(xo, T, D, T, D), 
where T = (T 1} . . . , T z ), D = {D x , . . . , D z ), f = (T x , . . . , f s ), and D = (D u . . . , D 2 ). 
This property relies on the well-known fact that, conditional on the number 
of events of a Poisson process during [0,t], the unordered event times are a 
sample of i.i.d. random variables uniform on [0,t]. Write 

#(t,d,t,d) = /(s t (x ,t,d,t,d)). 

We prove that, conditional on Z t = z and Z t = z, the random variable f(X t ) 
is highly concentrated, by showing that g satisfies a "bounded differences" 
condition. Suppose first that we alter a single co-ordinate value dj. Then 
the value of g can change by at most 2, by Lemma 5.4 starting at time 
tj with || xt. — ytjlli < 2; the same holds if we alter a single co-ordinate 
value dj . Similarly, if we change a co-ordinate value tj or tj , the value of g 
can change by at most 2: we may see this by applying Lemma 5.4 once at 
the earlier time and once at the later time. Thus, changing any one of the 
2(z + z) co-ordinates can change the value of g by at most 2. Now we use 
the independent bounded differences inequality, see, for instance, [14]. We 
find that, for each u > 0, 



Pr (| 5 (T,D,T,D) - E[ 5 (T,D,T,D)]| > u) < 2exp 
In other words, we have proved that, for any u > 0, 
(20) Pr (\f(X t ) - fi{t, z,z)\> u\Z t = z,Z t = z)<2 exp 



u 2 



4(z + z) 



u 2 

' A{z + z) 

Next we will remove the conditioning on Zt and Zt- We will choose suitable 
"widths" w = w(n) and w = w(n), where < w < Xnt and < w < bnt. Let 
/ denote the interval of integer values z such that \z — Xnt\ < w, and let / 
denote the interval of integer values z such that \z — bnt\ < w. Recall that 
we shall ensure that with high probability Zt £ L and Zt E I, and for each 
z £ L and z £ L, the difference \ fi(t, z,z) — £i(t)\ is not too large. 
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Since Zt ~ Po(Ani) and Z% ~ Po(bnt), by (5), 

w 2 \ 



' 3Xnt J 



(21) Pr(Z t <£l)= Pr (\Z t -Xnt\ > w) <2exp( 
and 

(22) Pr (Z t i I) = Pr ( | Z t - but | > w) < 2 exp (- . 



We shall choose w and w to satisfy w > 2(Xnt hm) 1//2 and w > 2(bnt Inn) 1 / 2 . 
Then, by (21), (22), (5) and (7), provided that b satisfies 6 = o(n 1 / 3 ), 

V[Za {Zt0 vz t 0)] < nza Zt >Xnt +w ] + Ant Pr (Z t $ 1) = o(l) 

and 

V[Zt\ ZtiIv z t fi)] < nZtl Zt>bnt+iD ] + bntPv {Z t i I) = o(l). 
Hence, since \f(X t )\ <Z t + Z t , 

B[\f(X t )\ l (ZttIv2tti) ] = o(l). 



But 



A(t)= £ A(* ) 2,5)Pr(Z t = i! ,Z t = 5) + E[/(X t )l (Zij y v ^]. 



Hence, 



£b(t) < max z,z)} + o(l), 
zei,Sei 

and, using also (21) and (22), 

p,(t) > min _{jl(t,z,z)} + o(l). 
zel,Sei 

By Lemma 5.4, for each z, z, 

+ -A(t,z,z)| < l 

and 

|A(i,z,z + l) - fi(t,z,z)\ < 1. 
It follows, using the bounds above on p,(t), that, for each z 6 / and z E I, 
(23) |£(t, z, 5) - < 2(w + w) + o(l). 
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Now by (20), (21), (22) and (23), 

Pr(|/(* t )-£(t)|> (3 + o(l))(w + w)) 

< Pr(\f(Xt)-m\>(3 + o(l))(w + w)\Z t = z,Z t = 2) 

x Pr (Z t = z,Z t = z) 
+ Pr (Z t <£I) + Pr(Z t I) 

< Pr(\f(X t )-fi(t,z,~z)\>(l + o(l))(w + w)\Z t = z,Z t = z) 
zei,z£i 

x Pr(Z t = z,Z t = z) 
+ Pr (\Z t -Xnt\ >w) + Pr (\Z t -bnt\ > w) 



(l + o( l))( w + ^)2 X / w 2 X / W 2 \ 

4(Ant + bnt + to + w) J 6XP V 3Ani/ 6XP V 3bnt J 
( (l + o(l))(w + w) 2 \ ( w 2 \ ( w 2 \ 

since b(n) — > oo as n —> oo. Let u satisfy 

6(n6ilnn) 1/2 < u < 3V\bnt. 

Let w = u/3 and w = W\J A/6. Observe that, for n sufficiently large, the 
bounds required above on w and w hold, and u = (3 + o(l))(u> + Thus, 

Pr(|/(X t ) - £(t)| > «) < 2e -( 1 +°( 1 ))« 2 /(45nM) +4e -« 2 /(27nM) 

< e -« 2 /(46nW) 

for n sufficiently large. But if u < 6(nbt Inn) 1 / 2 , then e -« 2 /( 46nfe *) > j^,- 1 . 
Thus, as long as u < 3\^Xbnt, we have 

Pr (\f(X t ) - j±(t)\ >u)< ne-^K^nbt)^ 

Now we move from Xt to Xt. Note that in [0, t] there are Zt arrivals and 
at most |Xo| + Z t departures, and so \f(X t ) — f(X t )\ < 2(\Xq\ + 2Z t ). Thus, 
since also Xt = Xt on At , 

\m-Kt)\ = |E[(/(X t ) - f(X t ))l^ t }\ < 2E[(\X \ + 2Z t )l 3 J. 

But |X | <n6/3 and E[Z t l 3 J < 2Ant Pr (A t ) + E[Z t lz t>2 An*]- Hence, 

|£(t) " < (2"6/3 + 8Xnt) Pr (A t ) + 4E[Z t lz t > 2 Ani] = o(l), 
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by (18) and (7). Thus, 

Pr (\f(X t ) - M (t)| > u) < Pr (\f(X t ) - fc(t)\ >u + o(l)) + Pr (A t ) 

< ne -(«+o(l)) 2 /(46n.W) + e -61n6/13 

by (18) (since we assume that t < e b ). The lemma now follows, by replacing 



We shall use Lemma 5.5 here with Xq = to complete the proof of 
Lemma 5.1. As we saw before, we may assume that /(0) = 0, and, hence, 
always |/(a;)| < \x\. It remains to relate the distribution of Xt with Xq = 
to the equilibrium distribution, and to choose values for b and t. By The- 
orem 1.1, if Y has the equilibrium distribution, then dyy (C(X t ) , £(Y)) < 
Ane - *. Hence, for all n sufficiently large, b > 41nn/lnlnre and u > 1, 



Let u > 2(nln 3 n/clnlnn) 1 / 2 . Let t= (u 2 c In In n/n) 1 / 3 and b = [i/lnlnnj. 
Then t > ^lnn. Also, ln6> (1 + o(l)) In Inn, so 61n6> (1 + o(l))t = 
Further, cu 2 /(nbt) = 0(t). It now follows from (24) that 



This completes the proof of Lemma 5.1. 

6. Balance equations. In this section we suppose throughout that the 
system is in equilibrium. We present the balance equation (26), and de- 
duce Lemma 6.1, which we shall need in Section 8, concerning the expected 
proportion of bins with at least i balls. 

Let d > 2 be a fixed integer. Consider a positive integer n, and the corre- 
sponding set of load vectors. For x £ £1 and a nonnegative integer k, let 
u(k,x) be the proportion of bins j with load x(j) at least k. Thus, always 
u(0,x) = 1. Let X have the equilibrium distribution over Q, and let u(k) 
denote E[u(/c,A A )] (which depends on n). [Thus, u(k) =l(k)/n, where l(k) 
was defined earlier as the expected number of bins with load at least k.] 



u by min{n, 3 



VXbnt}. □ 



(24) 



Pr(|/(r)- M (i)|>u) 

< d T y(C(X t ),C(Y)) + Pr - M (t)| > u) 

< \ne~ l + ne~ cu2/{ntb) + e" cnt + e" c61nb . 
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Lemma 6.1. (a) There is a constant c such that, for n sufficiently large, 
if j > In Inn/ In d + c, then u(j) <n~ 1 In 3 n. 

(b) For any r] > 0, there is a constant c such that, for n sufficiently large, 
if J '< In Inn/ In d — c, then u(j) > n - ^ . 

The rest of this section is devoted to proving this lemma. First we present 
the balance equations. 

It is easy to check (see [21]) that, if / is the bounded function f(x) = 
u(k,x), then the generator operator G of the Markov process satisfies 

Gf(x) = X(u(k - l,x) d - u(k,x) d ) - k(u(k,x) - u(k + l,x)) 

[cf. with equation (1) earlier]. To see this, note that u(k,x) — u(k + l,x) is 
the proportion of bins with load exactly k, and u(k — l,x) d — u(k,x) d is the 
probability that the minimum load of the d attempts is exactly k — 1 . Since 
X is in equilibrium, E[Gf(X)] = 0. Hence, 

(25) \(E[u{k - 1, X) d ] - E[u(k, X) d }) - k{u(k) - u{k + 1)) = 0. 



Now 



and so 



n " \ z / n 

k>l j=l 



Hence, ku(k) — > as k — > oo. Also, E[u(fc, X) d ] < u(k). It follows on sum- 
ming (25), for k>i, that, for each i = 1, 2 . . . , we have 

(26) \E[u(i-l,X) d ]-iu(i) - ^ u(k) = 0. 

k>i+l 

(This is the result that ~E[Gf(X)] = 0, where f{x) is the number of balls 
of "height" at least i, i.e., f(x) = X)j=i( x (j) — * + Du ^ since / is not 
bounded, we cannot assert the result directly.) Equation (26) is the key fact 
in our analysis. Observe that, by (26), for each positive integer i, 

(27) u(i)<-E[u{i-l,X) d }. 

i 

We are now ready to prove the lemma, part (b) first. Let a = [2A] — 1. 
We shall show that u(a) is at least a positive constant, and the u(i) do not 
decrease too quickly for i>a. 

Note first that, since E[u(i - l,X) d ] > u(i - l) d , by (26), we have 

(28) \u(i - l) d - iu{i) - Yj u ( k ) ^ °- 

k>i+l 
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Also, since < u(i — 1,X) < 1, we have E[u(i — 1,X) ] < u(i — 1) and so 
by (27), for each i = 1,2, . . . , we have u(i) < \u(i — l)/i. Thus, for i > a, 
we have tt(i + 1) < \u(i)/(i + 1) < u(i)/2. Hence, if k>i> a, then iz(fc) < 
2~^ k ~'^u(i); and so 

ii(A;) < u(i) for i> a. 

k>i+l 

It now follows from (28) that, for i > a, we have Xu(i — l) d — (i + l)u(i) < 0; 
and, thus, 

/ s A«(t - l) d 
29) u(i) > — i >— for j > a. 

Inequality (29) will show that the ti(i) do not decrease too quickly for i>a. 

Now consider small values of i. Let i S {1, . . . , a}. Since u(i) > u(/c) for 
k>i, we have (a — i)u(i) — X)fc=i+i u (k) > 0. Hence, by (28), 

0>A-u(i-l) d -in(i)- ^ tt(jfe) 

fc>i+l 

> \u(i - l) d - au{i) - u ( k ) 

k>a+l 

>\u(i-l) d -(a + l)u(i). 

Thus, we have 

u(i) > - u(i - l) d for i = l,..., a. 

The last inequality shows that there is a constant 5± > (depending only 
on A and d) such that always u(a) > S±. But by (29) and induction on i, for 
each z = 1,2,..., 



u ( a + /_,.■, ~ , T^TZ , — TTd2 ~ , n ^-i u ( a Y 



(a + i + l)(a + i) d (a + i- l) d2 • • • (a + 2) d 
To upper bound the denominator, note that 

ln((a + 1 + l)( a + i) d (a + i - if ■ ■ ■ (a + 2)* -1 ) 

i 

= cf ^ d" fc ln(a + fc + 1) < c 2 cf 
fc=i 

for some constant C2, and so the denominator is at most e C2<i \ Let 52 > be 
the constant \e~ C2 5\. Then 

1 



u(i) > u(a + i)> 5 2 = exp -cf In 

02 
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for each i = 1,2, ... . Let the constant C3 be such that d~ C:i In j- < i]. Hence, 
if i < lnlnn/lnd — C3, then 

u(i) > exp ( — (In n)d~ C[1 In — 
V o 2 

> exp(— rylnn) = n _r? . 

This completes the proof of part (b) of the lemma. 

We now prove part (a) of the lemma. By Lemma 5.2, there exists a con- 
stant ci > such that, for all positive integers i and n, 

(30) u(i) <-(u(i- l) d + an" 1 In 3 n). 

1 

Let i* = i* (n) be the smallest positive integer i such that u(i — l) d < c\n~ l In 3 n, 

that is, u(i - 1) < c] /d n~ 1 l d {\anfl d . We may assume that n is sufficiently 

large that ciln 3 n > 1, and so the quantity c^ d n _1 / d (lnn) 3 / d in the last 
bound is > 1/n. Note that, by (30), 

2A 

< — cin -1 In 3 n = o(n _1 In 3 n), 

since i*(n) — > 00 as n — ► 00 by part (b). 
We want an upper bound on i*. By (30), 

\ 

(31) < — u(i - l) d 

i 

for each i = 1, — 1. Let iq be the constant [2eA]. We check that 

i* < In In n/ In d + «o + 2. Since i*(n) — > 00 as n — > 00, we may assume that 
i < i* - 1. By (31), u(i ) < f «(io - l) d < f < e" 1 . Also by (31), for i = 
z'o + 1, •••,«* — 1, we have < n(i — l) rf , and it follows that u(i) < e~ d% '° for 
each i = iq, ... ,i* — 1. But e _d ' 10 < 1/n when cf - * > Inn; that is, when i > 
In Inn/ In ci + io- Thus, if > In Inn/ In d + io + 2, then n(i* — 2) < 1/n, con- 
tradicting the choice of i* . This completes the proof of part (a) of Lemma 6.1, 
and thus of the whole lemma. 



7. Random walks with drift. In this section we consider a generalized 
random walk on the integers, which takes steps of 0,±1 but with probabil- 
ities that can depend on the history of the process, and where there is a 
"drift." We shall use the following version of the Bernstein inequality — see 
Theorem 2.7 in [14]. 

Lemma 7.1. Let b>0, and let the random variables Z\,...,Z n be inde- 
pendent, with Zk — ~Ei[Zk\ > —b for each k. Let S n = J2 k Z^, and let S n have 



22 



M. J. LUCZAK AND C. MCDIARMID 



expected value fi and variance V (assumed finite). Then for any z > 0, 

(32) Pr (S n <a-z)< exp ( • 

v ' v n_p ; - P V 2V + (2/2,)bz) 

(The term |6z should be thought of as an error term.) The next lemma 
concerns hitting times for a generalized random walk with "drift." 

Lemma 7.2. Let fa C fa C • • • C m &e a filtration, and let Y\, Y2, . . . , Y m 
be random variables taking values in { — 1,0,1} such that each Y{ is fa- 
measurable. Let Eq, Ei, . . . , E m _i be events, where Ei G fa for each i, and 
let E = /\ i E i . Let R t = R + Ya=i Yi- Let0<p< 1/3, let r and r x be in- 
tegers such that ro < n, and let pm > 2(ri — ro). Assume that, for each 
i = 1, . . . , m, 

Pr {Yi = l|0i_i) > 2p on E^ x A < n) 

and 

Vr(Y i = -l\fa_ 1 ) <p on Ei-i A (Ri-i < r\). 

Then 

Pr (E A (Rt < n Vt e {1, . . . ,m})|_R = r ) < exp 

Proof. Let us first prove the lemma assuming that the above inequal- 
ities on Pr(Y; = l|</>j_i) and Pr(Y; = — l|(/)j_i) hold a.s.; that is, ignoring 
the events -Ei_i A (Ri-\ < ri). We shall then see easily how to incorporate 
these events. 

We can couple the Yi with i.i.d. random variables Zi taking values in 
{-1,0, 1}, such that Pr (Z { = 1) = 2p, Pr (Z { = -1) =p and Pr (Z { < Y) = 
1 for each i. The variables Z\,Zi, ■ ■ ■ are independent; E[Zj] =p, Var[Zj] < 
3p, and Z{ — E[Zj] > — 1 — p > —4/3 for each z. Let St = Yl\=i %i> let L l t = 
E(St) =pt, and note that Var(S' t ) < 3tp. Hence, by Bernstein's inequality, 
Lemma 7.1, for each y > 0, 

Pr (S t <fH~y)< exp ( ~tt~~ — ) • 
V Qpt + yJ 

Note that /i m =pm. Thus, if a = r\ — ro, 

Pr{R t <n Vte{l,...,m}\R = r ) 
< Pr (S m < a) 
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H-tO-^) 1 ) 
- exp (r^J' 

since a/pm < 1/2. 

Now let us return to the full lemma as stated, with the events E^. For each 
i = 0, 1, . . . , m — 1, let Fi = Ei A (Ri < ri); and for each i = 1, . . . ,m, let %, = 
Yi ■ + lp- 1 • Let Rq = Rq and for t = 1, . . . , m, let = i?o + Si=i ^i- 
Then Pr (Yi = l\fa-i) > 2p, since, by assumption, it is at least 2p on i^—i, 
and it equals 1 on i^-i. Similarly, Pr (1^ = — l\fa-i) <p. Hence, by what 
we have just proved applied to the Yi, 

Pr (E A (Rt < T\ yte{l,...,m})\Ro = r ) 

= Pr(EA{R t < n Vte {1, ...,m})\R = r ) 
< Pr (R t < n Vte{l,...,m}\R = r ) 
( pm\ 

as required. □ 

The next lemma shows that if we try to cross an interval against the drift, 
then we will rarely succeed. 

Lemma 7.3. Let a be a positive integer. Let p and q be reals with q > 
p > and p + q < 1. Let fa C fa C fa C • • • 6e a filtration, and let Y±, Y%, . . . 
be random variables taking values in { — 1,0,1} such that each Yi is fa- 
measurable. Let Eq, Ei, . . . be events where each Ei £ fa, and let E = Ai-^i- 
Let Rq = and let R^ = J2i=i Yi for k = 1, 2, . . . . Assume that, for each 
i = l,2,..., 

Pr {Yi = l|^_i) < p on Ei_ x A (0 < R^ < a - 1) 

and 

Pr = -l\fa-!) >q on E^ x A (0 < R^ x < a - 1). 

Lei 

r = inf{fc>l:i2A.€{-l,a}}. 

T/ien 



Pr(EA(i? T = a))<(p/g) a . 
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Proof. As with the previous lemma, let us first prove this lemma as- 
suming that the given inequalities on Pr (Yj = and Pr (Yj = — 
hold a.s. We can couple the Yj with i.i.d. random variables Yj taking values 
in {0, ±1} such that Pr (Yj = l)=p, Pr (Yj = -l)=q and Pr (Yj < Yj) = 1. 
Let Rq = 0, let R k = Y$=i Yj for A; = 1, 2, . . . , and let 

f = ml{k>l:R k G {-l,a}}. 

Then from standard properties of a simple random walk, 

Pr (i? T = a) < Pr (i? f = a) < (p/g) a . 

Now let us incorporate the events Ei, and consider the full lemma as 
stated. For each 2 = 0,1,..., let Fi = Ei A (0 < R4-1 < a — 1); and for each 
2 = 1,2,..., let Yj = Yjlp^j — lp. . Let iZfc and T be defined in the obvious 

way. Then Pr (Yj = l|0j_i) < p, since, by assumption, it is at most p on 
Fi-i, and it equals on Fi-\. Similarly, Pr(Yj = — 1 |0z — l) > 9- Hence, by 
what we have just proved applied to the Yj, 

Pr(£A (R T = a)) < Pr(R f = a) < {p/q) a . □ 

8. Proof of Theorem 1.3. We have assembled all the preliminary results 
we need. In this section we at last prove Theorem 1.3, and inequalities (3) 
and (4) that follow it. We assume throughout that the process is in equilib- 
rium. 

Let d > 2 be a fixed integer. Consider the ra-bin system. Recall that u(k) 
is the expected proportion of bins with load at least k. Define j* = j*(n) to 
be the least positive integer i such that u(i) < n" 1 / 2 ^ 3 n. By Lemma 6.1, 

j*(n) =lnlnn/ln<i + 0(l). 

We shall show that, 

(33) for d = 2, we have M = j* or j* + 1 a. a.s. 
and 

(34) for d > 3, we have M = j* — 1 or j* a. a.s. 

This will complete the proof of the first part of Theorem 1.3. 

For each time t and each i = 0, 1, . . . , let the random variable Zt(i) be the 
number of new balls arriving during [0, t] which have height at least i on 
arrival, that is, which are placed in a bin already holding at least i — 1 balls. 

Let Jo = and enumerate the arrival times after time as J\ , J2 , We 

shall define a "horizon" time to of the order of Inn, and let iV = [~2A?iiol • 
For each time t, let At be the event 

{An/2 < \X S \ < 2Xn Vs G [0,<]}. 

Then by Lemma 2.2, the event At holds with probability 1 — e~^ n \ 
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8.1. The case d > 3. We consider first the case when d > 3, which is 
easier than when d = 2. 

Let K > be a (large) constant and let to = (K + 4) Inn. Since — 1) > 
n 1 / 2 In 3 n, the concentration result Lemma 5.3 shows that Pr (M < j* — 1) = 
e -n(in 2 n)_ i n particular, M > j* - 1 a.a.s., which is "half" of (34). Also, (8) 
in Lemma 2.1 above [with s = n~( K+A \ a = j* — 3 and 6 = 1] shows that 

(35) min{Mt : < t < n K } >j* -2 a.a.s. 

This result establishes a finer form of the lower bound half of (2). In fact, 
this half of (2) will follow from (3) which we prove later, so (35) is not needed 
for our proofs. 

Next we shall show that M < j* a.a.s., which is the other half of (34). 
For k = 0, 1, . . . , let be the event that at time there are no more 
than 2n 1 / 2 ln 3 n bins with at least j* balls. Then Pr(E£) = e~ Q(ln2?l) by 
Lemma 5.2, since < n 1 / 2 ln 3 n. Consider the ball which arrives at time 
Jk- on Ek-i, it has probability at most p\ = (2?i~ 1 / 2 In 3 n) d of falling into a 
bin with at least j* balls. Note that 

(36) Pr(J N+1 <t ) < Pr[Po(\nt )>2\nt ]=e-^ nlnn \ 
Also, for each positive integer r, 

Pr (B(N, P1 ) >r)< (N Pl ) r = 0{{n^ d ' 2 -^ (lnn) M+1 ) r ). 

(Here we are using B to denote a binomial random variable.) Hence, for 
each positive integer r, using Lemma 5.3, 

Pr[Z t0 (f + l)>r] 

<-Pr(B(N, Pl )>r)+Pr(\f E^\ + Pr ( J N+1 < t ) 

\k=0 J 
= 0((n~W 2 ~ 1 \lnn) 3d+1 ) r ). 

Also, the probability that some "initial" ball survives to time to is at most 
Ane - * , as we saw earlier. Hence, for each positive integer r, 

Pr [M > j* + r] < Pr [Z to (j* + 1) > r] + Ane - * . 

In particular, Pr (M > j* + 1) = o(l), which together with the earlier result 
that M > j* — 1 a.a.s., completes the proof of (34). Further, 

Pr [M > j* + 2K + 5] = o{n~ K ~ 2 ). 

Now (9) in Lemma 2.1 with r = n K and s = n~ 2 , together with (35), lets us 
complete the proof of (2). 
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8.2. The case d = 2. The case d = 2 needs a little more effort, and uses 
the "drift" results from the last section. Again, let K > be a (large) con- 
stant, but now let to = (2K + 8) In n. We first show that M > j* , by showing 
that, in fact, 

(37) L t0 (j*)>\n 3 n a.a.s. 

Let Jq = 0, and enumerate all jump times after time (not just the arrival 
times) as J[, Note that J' n < to a.a.s., since 

Pr (J' n > t ) < Pr ( J n > t ) = Pr (Po(Ani ) < n) = e - n (" ln ") 
by (5). For k = 0, 1, . . . , let E k be the event Aji A (Ljr (j* — 1) > in^ln 3 ™). 
Let E = Afc=o-Efc- We saw earlier that Pr(^ ) = o(l). By Lemma 5.2, as 
before, we have Pr (Lj>(j* - 1) < |n 1 / 2 ln 3 n) = e- n ^ 2n \ Thus, 

Pr (E) < Pr (AT ) + Pr > t ) + ne^^ 2 ™) = o(l). 

For = 0, 1, . . . , let Rk = Lj>(j*) and for k = 1,2, ... , let Y^ = Rk — Rk-i, 
so that 

3=1 

Let p 2 = ln 6 n/(24?i), and let n = [21n 3 nJ . On E k ^ x A (R k -i < n), 

Pr(n = l|0 J /_ i )>2p 2 

and 

Pr(y fc = -l|6* )<p2, 

fe— 1 

for n sufficiently large. [Here we use <j>t to denote the <r-field generated by 
(X s :0 < s < t).] Also, then np2 > 2r\. Hence, by Lemma 7.2, for each integer 
ro with < tq < r\ , 

Pr (E A (12* < n V k G {1, . . . , n})|12o = r ) < e" P2n/28 . 

Since Pr(£J) = o(l), it follows that a.a.s. Rk > ri for some A; £ {1, . . . 

(If Ro = r\, then we may replace ri by = n + 1 above: if Rq > ri + 1, then 

i2i > n a.s.) Thus, a.a.s. Ljt(j*) > [21n 3 riJ for some fc G {1, . . . ,n}. Finally, 

since J' n < to a.a.s. as we saw above, we find that a.a.s. L t (j*) > [21n 3 nJ for 
some t G [0, to]- 

In order to complete the proof of (37), it suffices to show that a.a.s. 
there will be no "excursions" that cross downwards from [2 In 3 nj to at most 
ln 3 n. Let B be the event that there is such a crossing. The only possible 
start times for such a crossing are departure times during [0, to]- Recall that 
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N = [2Antol- Now \Xq\ < N a.a.s. and we saw in (36) that a.a.s. there are 
at most N arrivals in [0, to]- Hence, if C denotes the event that there are 
more than 2N departures during [0, io]> then Pr (C) = o(l). 

We may use Lemma 7.3 (suitably translated and reversed) to upper bound 
the probability that any given excursion leads to a crossing. Let a = [In 3 nj . 
Let p = 2p2 and let q = P2- We apply Lemma 7.3 with a, p, q and Ek as 
above. We obtain 

Pr (B) < Pr (C) + N2~ a + Pr ( f\ = o(l). 

\fe=0 / 

Thus, we have established (37), and, hence, proved that M > 7 cl.cl.S. 

We now consider upper bounds on M. We shall show that M < j* + 1 
a.a.s., by showing that L to (j* + 2) = a.a.s. For k = 0, 1, ... , let F}. be the 
event that at the arrival time Jk there are no more than 2n 1//2 ln 3 n bins 
with at least j* balls. Since < n 1 ' 2 ln 3 n, Lemma 5.1 yields Pr(i^) = 
e -f2(ln n)_ Consider the ball which arrives at time Jk- on it has proba- 
bility at most P3 = 41n 6 ra/n of falling into a bin with at least j* balls. Thus, 
for each positive integer r, 

Pr (Z t0 (j* + 1) > r) < Pr (fl(JV,p 3 ) > + Pr ( V ^1 + Pr ( J n+i < to). 

\fc=0 / 

Also, the probability that some "initial" ball survives to time to/2 is at 
most Ane~*°/ 2 . Hence, there is a constant c such that, with probability 
1 - 0(n~ K ~ 3 ), we have L t (j* + 1) < cln 7 n uniformly for all t G [t /2,t ]. 
Thus, this also holds over [0, to] • 

For h = 0, 1, ... , let F' k be the event that at time there are no more 
than cln n bins with at least j* + 1 balls. On FL-y, the ball arriving at time 
Jk has probability at most p4 = c 2 In 14 n n~ 2 of falling into a bin with at 
least j* + 1 balls. Then for each positive integer r, 

Pr(Z t0 (j* +2) > r) < Pr(B(N,p A ) > r) + Pr ( \J Fj\ + Pr(J w+1 < to). 

\fe=0 / 

Also, as we noted above, the probability that some "initial" ball survives to 
time to is at most Ane , and so 

Pr (M to > j* + r + 1) < Pr (Z to {j* + 2) > r) + Ane"* . 

It follows on taking r = 1 that a.a.s. Mt < j* + 1; and on taking r = K + 3 
that 

Pr (M t0 > j* +K + 4)= o(n- K ~ 2 ). 

Now (9) in Lemma 2.1(b), say with r = n K and s = n~ 2 , yields the upper 
bound part of (2). 



28 



M. J. LUCZAK AND C. MCDIARMID 



8.3. Completing the proof. In this section d will be any fixed integer at 
least 2. The lower bound half of (2) will follow from (3), which we now 
prove. [See also (35) above.] Let < e < |, and let r = exp(n 1 ' s ~ e ). By- 
Lemma 6.1, there is a constant integer c > such that — c) > n 1 " 6 / 2 . By 
the concentration result Lemma 5.1 [applied to the function L(j* — c), with 
u = n 1 -"/ 2 ], 

Pr (M < j* - c) = Pr (L(j* - c) = 0) = exp(-0(n 1/3 " e/3 )). 

Now we may use inequality (8) in Lemma 2.1(b), with s = 1/r, a = j* — c — 3 
and 6 = 2, to show that a.a.s. Mj > j* — c— 2 for all £ 6 [0, r] . This completes 
the proof of (3). 

It remains to prove (4). Let z = z(n) be a positive integer such that 
In z = o(lnn). Note that balls choosing bin 1 on each of their d trials arrive 
in a Poisson process at rate An - ^" 1 ) (recall that balls choose bins with 
replacement). Let C be the event that, in the interval [t, t + 1), there are 
at least z balls which arrive, choose bin 1 each time, and survive at least to 
time t + 1. Then 

Pr (C t ) > (1 + o{l)){\n^ d - l h- l fe- z 

= exp(— (d — l)zlnn — z\nz + O(z)) 

= „-(d-l+o(l))* i 

Hence, 

Pr (M t (n) < z Vt G [0,t)) < Pr (X t (l) < z Vt G [0,r)) 

< Pr (each of Co, . . . , Ci r j_ 1 fail) 

<(l- n -{d-l+o{l))zy 
<exp(-rn-^ 1+ °( 1 )) 2 ). 

Hence, 

(38) Pr(M f (n) <z VtG [0,r))->0 as n ^ oo if rn" (d ~ 1+o(1))z ^ oo. 
This yields (4). 

9. Chaoticity. As usual, fix a positive integer d: let us assume here that 
d>2. One consequence of our concentration results is that asymptotically, 
as n — > oo, individual bin loads become independent of one another. Thus, 
our network satisfies the chaos hypothesis, Boltzmann's stosszahlansatz [6]. 
In recent years chaoticity phenomena have received considerable attention [6, 
9, 10, 18] in the context of various multitype particle systems, such as com- 
puter and communication networks, and interacting physical and chemical 
processes. Consider the equilibrium case. 
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Proposition 9.1. Fix an integer r>2. Consider the n-bin model, with 
load vector X in equilibrium. For any distinct indices j±, . . . , j r , the joint law 
of X(ji), . . . ,X(j r ) differs from the product law by at most 0(n _1 ln 4 n) in 
total variation. 

Proof. As before, let u(k,X) denote the fraction of bins with load at 
least k. By Lemma 5.2, 

sup Pr (\u(k, X) -u{k)\ > n~ 1/2 ln 3/2 n) = (Din" 1 ). 

k 



Hence, for each positive integer a < r, 



(39) sup E 



\ \u(k s ,X) - u(k s 



Ls=l 



Q(n^ 2 ln 3a / 2 n) + 0(n~ 1 ), 



where the supremum is over all a-tuples k±, . . . ,k a of nonnegative integers 
(not necessarily distinct). But 



E 



Ju(k s ,X) 



s=l 



\ u(k s 



s=l 



E E 

AC{l,...,r},|A|>2 



U(u(k s ,X)-u(k s )) 



n 

J se{l,...,r}V4 



Hence, by (39), uniformly over all r-tuples k 



i , . . . , kf , 



E 



~[u(k s ,X) 



Y[u(k s ) 



s=l 



(40) 



< E E 

AC{l,...,r},\A\>2 
= 0(n" 1 ln 3 n). 



J \u(k s ,X) - u(k s 



Now 



Thus, 



u 



( k ' x ) = -J2 1 xu)>k- 



E 



\u{k s ,X) 



s=l 



n" r E 



r n 

nE 

s=lj=l 



E 



.5 = 1 



X(s)>k s 



+ 0{n~ 1 ) 
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uniformly over all r-tuples k\ , . . . , k r , since when we expand the middle ex- 
pression, there are 0(n r ~ 1 ) terms for which the values of j are not all dis- 
tinct. Hence, from (40), 



sup 



E 



IT* 



X(s)>k s 



I u(k s 



.5 = 1 



0(n _1 In 3 n). 



But 



Pr /\{X{s)=k s ) 



E 



,s=l 



Ls=l 



L xO)>fc s +i, 



which is sum of 2 r terms ±E[TIs = i lx(s)>fcj> where k' s = k s or fc s + 1; and 
ris=i P f = ^s) is a corresponding sum of terms ± JYs=i U (K)- Hence, 

for any set j±, . . . ,j r of distinct bin indices, 



(41) sup 



Pr f\(X(j s ) = k s ))-l[Pr(X(j s 

\s=l / s=l 

But there exists a constant c > such that 



ks 



0(n _1 In 3 n). 



Pr ^maxX(j) > lnlnn/lnci + cj = 0(n 1 ). 

Hence, for any set ji, . . . ,j r of distinct bin indices, the joint law of X(j\), . . . , X(j r ) 
differs from the product law by at most 0(re _1 In 3 n(lnlnn) r ) in total vari- 
ation. □ 



The last result, together with the rapid mixing result Theorem 1.1, shows 
that, with suitable initial conditions, bin loads will be nearly independent 
after a short time. Suppose, for example, that we start with all bins empty 
or, more generally, with 0(n) balls in total, and let t = t(n) > 2 Inn. Let 
ji, . . . ,j r be fixed distinct indices, where r > 2. Then if Y has the equilibrium 
distribution, 

drvMXtfo), . . .,X t (j r )),C(X t (ji)) <8> • • • ® C{X t (j r ))) 
< <hv(£(Y(ji), . . . ,Y{ ]r )),C{Y{h)) ® ■ • ■ ® £(Y(j r ))) 

+ (r + l)d TV (C(X t ),C(Y)) 
= 0(n" 1 ln 4 n). 

10. Concluding remarks. We have investigated a natural continuous- 
time balls-and-bins model with d random choices, which exhibits the "power 
of two choices" phenomenon. We found that the system converges rapidly 
to its equilibrium distribution; in equilibrium, the maximum load is a.a.s. 
concentrated on just two values; when d = 1, these values are close to 
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Inn/ In Inn; and when d>2, they are close to In Inn/ In d, and the max- 
imum load varies little over polynomial length intervals. We make three 
further remarks: 

(a) We have not discussed the next level of detail. For example, for given 
values of d > 2 and A > 0, let m(d,X;n) denote the median value of 
the maximum load in equilibrium. We know that the difference 
m(d, A; n) — lnlnn/lnd stays bounded as n — ► oo, but how does it behave 
in more detail? How does it depend on A? 

(b) Our approach can be applied, in a natural way, to the "original" load- 
balancing problem, where m ~ cn balls are thrown into n bins sequen- 
tially, each ball chooses d random bins, and is placed into a least loaded 
of these bins, see [2, 3, 16]. It is known that, with probability tending to 
1 as n — > oo, at the end of the allocation process the maximum load of a 
bin is lnlnn/lnd + 0(1), though it has not been possible to determine 
the behavior of the 0(1) term. We make a step forward here, and see 
that the maximum load is concentrated on at most two values, as in the 
processes considered earlier in this paper. 

We embed the process in continuous time, and for the n-bin case, 
we assume that balls arrive in a Poisson process of rate n. A natural 
coupling, combined with the bounded differences method, yield concen- 
tration of measure for Lipschitz functions. As before, let (X t ) denote 
the loads process, let u(i,x) be the proportion of bins with load at least 
i in state x, and let u t (i) = E[u(i,Xt)]. Let to > be a fixed time. Then 
uniformly over t S [0,io] an d over i G N, 

du t (i) 



dt 



B[u(i-l,X t ) d ]-B[u(i,X t ) d ] 

= u t (i - l) d - u t {i) d + 0{n~ l In 2 n). 
Let (v(t, i) : i = 0, 1, . . .) solve the system of differential equations 

(42) ^ = v(t,i-l) d -v(t,i) d 

subject to v(t, 0) = 1 for each t > and v(0, i) = for each i = 1, 2, 

Then, using Gronwall's lemma (see, e.g., [8]), 

sup sup \ut(i) — v(t, i)\ = 0(n _1 In 2 n). 

0<t<t i€N 

Defining j* = j*(n) to be the least positive integer i such that v(t,i) < 
2?i~ 1 / 2 hm, with high probability, the maximum load of a bin when 
about tn balls have been thrown will equal j* — 1 or j* when d > 3, and 
will equal j* or j* + 1 when d = 2. Note that j* is defined purely in 
terms of the solution to the limiting differential equation (42). 
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(c) Our methods can be adapted to handle the "supermarket" model. In 
this well-studied queueing model, see, for example, [15, 17, 23], there 
are n single-server queues, with service times which are independent 
exponentials with mean 1; customers (balls) arrive in a Poisson stream 
at rate An, where < A < 1, and go to a shortest of d randomly chosen 
queues. In [11] we are able to determine (for the first time) the behavior 
of the maximum queue length, and, indeed, we obtain similar results to 
those in the present paper. It is possible also to analyze queues with a 
number s = s(n) of servers, not just 1 or oo. 

Acknowledgment. We are grateful to a referee for a very detailed reading 
of the paper. 
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